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Analyses of peculiar velocity surveys face several challenges, including low signal-to-noise 
in individual velocity measurements and the presence of small-scale, nonlinear flows. I will 
present three new analyses that attempt to address these inherent problems. The first is geared 
towards the better understanding of the estimated errors in the surveys, specifically sampling 
errors, and the resolution of the seeming disagreements between the surveys. Another develops 
a new statistic that does not suffer from the usual problems and gives robust results that are 
galaxy-morphology and distance-estimator independent. The third introduces a formalism 
that allows for the accounting of most of the non-linear signal whereby the signal to noise is 
increased and small-scale aliasing is removed. 



1 Introduction 



The best developed class of theories of structure formation is based on the assumption of small 
amplitude Gaussian density perturbations which grew and condensed out, becoming, by gravita- 
tional instability, galaxies, clusters of galaxies, superclusters or voids. The study of the evolution 
of density perturbations is the basis of all theories based on gravitational instability 1 ' 2 . Recently, 
this class of theories was given a dramatic boost by the detection of appropriate temperature 
fluctuations in the cosmic microwave background. Further, recent work on the bispectrum mea- 
surement in the IRAS redshift catalogs 3 shows strong signature of gravitational instability. 

Statistical studies of the peculiar velocity field (those velocities in excess of the pure Hubble 
flow) based on the predictions of the gravitational instability scenario can potentially address 
many fundamental cosmological questions. Although in principle the galactic peculiar (local) 
velocity field holds a great promise as a direct probe of the underlying mass distribution, in 
practice the extraction of the information is difficult and fraught with both observational and 
theoretical pitfalls. Velocity surveys have irregular geometries and their boundary conditions are 
not usually well-known. Further, they are somewhat shallow and sample the volume discretely, 
non-uniformly and sparsely. Theoretically, the mapping from velocity to density is complicated 



by non-linear effects which necessitates various approximation schemes . Indeed, small-scale 
aliasing and incomplete cancellations 5,6 introduce spurious noise which masquerades as large 
scale signal. These effects are difficult to disentangle and thus the resulting information is 
unreliable 7 ' 8 . The bottom line with respect to peculiar velocity surveys is that it is very difficult 
to extract cosmological information since the large-scale signal has a significant small-scale noise 
which is virtually impossible to model accurately. 

Estimating cosmological parameters may be done by studying the large scale motions of 
galaxies. One advantage of this method is that the large scale velocity field probes the matter 
distribution in the Universe directly, and not merely the light distribution as redshift surveys 
do. However, to measure the velocity field one needs to make accurate distance measurements, 
which has proven to be quite difficult 9 . The errors in distance estimates are typically some 
fraction of the redshift of the sample points, which in the case of distant objects can mean that 
the errors are larger than the peculiar velocity being measured. This is partially rectified by 
measuring only the lowest moment of the velocity field, namely the bulk flow. Since the bulk 
flow is in a sense an average of the velocities in the sample, its error is reduced over that of 
an individual measurement by the square root of the number of objects 10 ' n . The idea here 
is that in calculating low-order moments the small scale modes will be averaged out, so that 
the values of these moments will reflect only large-scale motion. It has been shown, however, 
that the sparseness of peculiar velocity data can lead to small-scale modes making a significant 
contribution to low-order moments through incomplete cancellation 7,6 . Another drawback of 
this approach is that it utilizes only a fraction of the available information. 

An alternative method is to perform a likelihood analysis using all of the velocity informa- 
tion 12 . An obvious danger here is that retaining small-scale, nonlinear contributions to the 
velocities can lead to unpredictable biases which can skew the results 13 . This method also has 
the disadvantage of becoming unwieldy for surveys larger than about a thousand objects. While 
advances in computing will make this less of a problem in the future, clearly a less time-intensive 
method is desirable. 

The non-linear aliasing and incomplete cancellations inherent in the surveys effectively defied 
all attempts to extract robust cosmological information from the data. The remedies proposed 
in the literature, POTENT 14 ; denser directional surveys 6 and others where not satisfactory. 
Smoothing over the problems by introducing some averaging schemes did not work consistently. 
Smoothing, including Weiner Filtering 15 provide formalisms that may or may not remove small 
scale irregularities. The problem is that since smoothing operations provides little or no control 
over what we remove and what we keep, that it is impossible to state with any degree of 
certainty whether the smoothed field retains the large scale signal while removing the potentially 
significant small scale noise. 

Here I would like to discuss three programs that have been developed to address different 
aspects of the problems discussed above. The first attempts to better understand the error 
estimations inherent in velocity field surveys and to develop a scheme to compare them and see 
if the surveys themselves are compatible with each other within their errors and with the power 
spectrum of density fluctuations derived by other techniques. Another searches for a statistic 
that is robust and does not depend on the particular distance indicator or morphology. The 
last tries to deal with the problems head on, that is, to look for a statistical way to rid the 
surveys of their non-linearities in such a way that the large-scale signal remains intact whereas 
the small-scale "noise" is being removed. 

One idea proposed is to develop a more realistic error estimation for existing surveys, in 
particular, the understanding of the sampling errors 16 allows for consistent bulk flow estimates 
for most of the bulk flow measurements within the errors (see Table 1 below) . Additionally, we 
get an idea of how much correlation we expect between the different catalogs for a given power 
spectrum by calculating the normalized expectation value for their dot-product which should 



be close to 1 for highly correlated surveys, zero for those that are completely uncorrelated, and 
— 1 if there is a high degree of anti-correlation 5 > 17 > 18 . As we see elsewhere in this volume 16 the 
bulk flow measurements of various independent surveys are fairly correlated and a reasonably 
consistent bulk flow vector emerges from them that is 380 ± 80km/s towards I = 290° and 
b = 10°. 

Recent Large Scale Bulk Flow Measurements 
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Table 1: When we take into account both the total error, i.e. random and sampling errors, most bulk flow 
measurements agree with each other. For a more detailed analysis see Hudson, M., 2003, In this volume. 

Another program that seeks to address the inconsistencies between various velocity surveys 
results develops a statistic that is not as susceptible to small-scale aliasing and incomplete 
cancellations. In series of recent papers we introduced a new dynamical estimator of the £l m 
parameter, the dimensionless density of the nonrelativistic matter in the universe. We use the so 
called streaming velocity, or the mean relative peculiar velocity for galaxy pairs, vi2(r), where r 
is the pair separation 1 . It is measured directly from peculiar velocity surveys, without the noise- 
generating spatial differentiation, used in reconstruction schemes, like POTENT (see Courteau 
et al. 2000 and references therein). In the first paper of the series 24 , we derived an equation, 
relating vu(r) to fi m and the two-point correlation function of mass density fluctuations, £(r). 
Then, we showed that vu and Q rn can be estimated from mock velocity surveys 25 , from real 
data: the Mark III survey 26 and finally 27 a comparison of various independent surveys that 
use different standard candles, galaxy morphologies and surveying techniques. Whenever a new 
statistic is introduced, it is of particular importance that it passes the test of reproducibility. 
Our results pass these tests: the v±2(r) measurements are galaxy morphology- and distance 
indicator-independent 28 . 

Using mean relative peculiar velocity measurements for pairs of galaxies, we estimate the 
cosmological density parameter tt m and the amplitude of density fluctuations as- Our results 
suggest that our statistic is a robust and reproducible measure of the mean pairwise velocity 
and thereby the £l m parameter. We get f2 m = 0.30to;o7 and as = 1.13lo;ii- These estimates 
do not depend on prior assumptions on the adiabaticity of the initial density fluctuations, the 
ionization history, or the values of other cosmological parameters. 

In figure 1 27 we see that the claim that the v±2 statistic is independent of the galaxy 
morphology or the standard candle is correct. We show here four independent surveys, The 
Mark III 29 , a TF compilations of 2437 spiral galaxies. The SFI 30 catalogue contains 1300 late 
type spiral galaxies with /-band TF distance estimates. The ENEAR 31 survey with 1359 early 
type with D n -a measured distances. The RFGC 32 catalogue provides a list of radial velocities, 
HI line widths, TF distances and peculiar velocities of 1327 spiral galaxies that was compiled 
from observations of flat galaxies performed with the 305 m telescope at Arecibo. As can clearly 
be seen, all measurements agree with each other and thus allows us to combine the results and 
obtain a mean v\2{r) with smaller errors than for each individual survey. Further, we see that 
since all surveys give us identical results within the errors, ther is no sign for velocity bias. 
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Figure 1: (left panel) The crosses and the associated error bars show the weighted mean pairwise velocity, ob- 
tained by averaging over four surveys. Individual survey data points are also shown; we have suppressed their 
error bars for clarity. These direct measurements of W12 are compared to four Vi2{r) curves, derived by assuming 
four different models of £(r). The labels identify best fit Q m and as parameters. 

Figure 2: (right panel) The results of the maximum likelihood analysis. The upper panels show results for power- 
law toy models, while the bottom panels are based on realistic representations of observations: the APM and 
PSCz data, respectively. Likelihood peak coordinates and the values of \ 2 f° r each model are also indicated. The 
innermost contours define the 68%, or l-cr areas around the peaks. The remaining nested contours show the 2, 3 
and 4-<7 boundaries. 



In figure 2 we see the results of our estimates for the parameters, O m and erg. When we use a 
correlation function obtained observationally, such as from the IRAS-PSCz survey 33 or the one 
from the APM 3 we get results as those quoted above. However, if we use a correlation function 
£(r) oc r~ 18 (a seeming favourite in some quarters, through it is not compatible with either the 
PSCz or the APM surveys) we get very small Q m and very large a%. Thus this statistic can also 
act as a regularizing diagnostic for the form the correlation function takes around the scale it is 
sensitive (on the order of 10/i _1 Mpc). 

The last program I present is one that is designed to remove the non-linear signal from 
the survey. The method utilizes Karhunen-Loeve methods of data compression to construct a 
set of moments out of the velocities which are minimally sensitive to small scale power; these 
moments can then be used in a likelihood analysis. Overall sensitivity of the set of moments 
to small scales is quantified, and can be controlled through the number of moments retained in 
the analysis. Since the number of moments kept is typically much smaller than the number of 
velocities in the survey, this method has the added advantage of being much more efficient than 
a full analysis of the data. 

Karhunen-Loeve methods 35 ' 36 have recently become popular in cosmology. A general dis- 
cussion of their use in the analysis of large data sets was done here 37 . In addition, these methods 
have been applied to the Las Campanas Redshift Survey 38 , to velocity field surveys 39 , and to 
the decorrelation of the power spectrum 40 ' 41 . Although we use the same general method, our 
take on the formalism is quite different. Taking advantage of the compression techniques and 
the Fisher information matrix, we filter out small-scale, nonlinear velocity modes and retain 
only information regarding the large-scale modes. 

Technical details of the formalism are given elsewhere 42 > 43 > 44 . Here I would only like to 
present some of the results. The formalism allows the diagonalization of the covariance matrix 
with eigenvalues whose amplitudes are proportional to the sensitivity to small-scale modes. A 
set of optimal moments constructed as linear combinations of velocities which are minimally 
sensitive to small scales. The overall sensitivity of a set of moments to small scales can be 
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Figure 3: (left panel) The window functions in arbitrary units. The top panel shows the window functions as- 
sociated with the five smallest eigenvalues. The center panel shows the window functions associated with five 
somewhat larger eigenvalues, while the bottom panel shows the window functions selected from the entire range 
of large eigenvalues. For each panel we show the eigenvalue rank for each window function. 

Figure 4: (right panel) Likelihood vs. V and f3 for the simulated catalog for different N' , the number of moments 
kept; we also show the value of A9 q . The panels show the maximum likelihood value (solid triangles) and the 
contours corresponding to 0.5, 0.1 and 0.01 of the maximum values. The asterisk in each panel is the "true" value 
of r = 0.25 and f3 — 0.457 for the simulation. Increasing the number of modes TV' improves the accuracy of 
the maximum likelihood values up to a point, but the inclusion of large eigenvalue modes that carry information 
mostly from nonlinear scales can skew the result away from the true value. 



quantified and controlled through the choice of the number of moments retained. 

In Fig. 3 we show the window functions for selected moments in order of increasing eigenvalue. 
This demonstrates that selecting moments that are least sensitive to small scales generally 
results in moments that are most sensitive to large scales; window functions of moments with 
larger eigenvalues have successively larger amplitudes on nonlinear scales as expected. Thus the 
information contained in large eigenvalue moments comes mostly from scales where fluctuations 
are nonlinear and should not be included in a linear analysis. 

In Fig. 4 we show the results of the likelihood analysis on a typical catalog for various N'. 
For reference, we also give the value of A9 q for each N' . We see that in this case, inclusion of all 
of the information leads to the location of the maximum likelihood being skewed away from the 
true values. However, when higher order moments are discarded, the location of the maximum 
likelihood corresponds well with the true values. The fact that the discarding of higher order 
moments leads to a much better agreement between the maximum likelihood location and the 
true values is a good indication that our analysis method is effectively removing small-scale, 
nonlinear velocity information. 

Although proper distance surveys present many challenges to cosmologists, they are quite 
well suited to the extraction of cosmological information provides they are handled with care 
and caution. In this conference I and my colleagues showed that with the right approaches 
we can extract interesting and important information and that techniques we have developed 
should be added to the cosmological toolbox. 
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